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SEQUENCE LISTING 

4 (1) GENERAL INFORMATION: 

6 (i) APPLICANT: Yu, Guo-Liang 

7 Rosen, Craig 

9 (ii) TITLE OF INVENTION: Colon Specific Genes and Proteins 

11 (iii) NUMBER OF SEQUENCES: 24 

13 (iv) CORRESPONDENCE ADDRESS: 

14 (A) ADDRESSEE: Carella, Byrne, Bain, Gilfillan, Cecchi, 

15 Stewart & Olstein 

16 (B) STREET: 6 Becker Farm Road 

17 (C) CITY: Roseland 

18 (D) STATE: NJ 

19 (E) COUNTRY: USA 

20 (F) ZIP: 07068-1739 

22 (v) COMPUTER READABLE FORM: 

2 3 (A) MEDIUM TYPE: Floppy disk 

24 (B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC- DOS/MS -DOS 

2 6 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 

2 8 (Vi) CURRENT APPLICATION DATA: 

C--> 29 (A) APPLICATION NUMBER: US/09/988 , 292A 

C--> 30 (B) FILING DATE: 19-NOV-2001 

31 (C) CLASSIFICATION: 

3 3 (viii) ATTORNEY/AGENT INFORMATION: 

34 (A) NAME: Ferraro, Gregory D. 

.35 (B) REGISTRATION NUMBER: 36,134 

3 6 (C) REFERENCE/DOCKET NUMBER: 325800-435 

3 8 (ix) TELECOMMUNICATION INFORMATION: 

39 (A) TELEPHONE: 201-994-1700 

40 (B) TELEFAX: 201-994-1744 

4 3 (2) INFORMATION FOR SEQ ID NO : 1: 
4 5 (i) SEQUENCE CHARACTERISTICS: 

46 (A) LENGTH: 638 base pairs 

47 (B) TYPE: nucleic acid 

48 (C) STRANDEDNESS : single 

4 9 (D) TOPOLOGY: linear 
51 (ii) MOLECULE TYPE: cDNA 

54 (ix) FEATURE: 

55 (A) NAME/KEY: mat_peptide 

56 (B) LOCATION: 1. . 501 
58 (ix) FEATURE: 

5 9 (A) NAME/KEY: CDS 

60 (B) LOCATION: 1..501 
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63 




(xi) SEQUENCE DESCRIPTION: 


SEQ 


ID NO: 1 














64 


GCC 


AGG 


CAG 


CTG 


GCT 


GCC 


SAC 


CAG 


GCC 


GTG 


TAT 


GTG 


AAG 


GTC 


AAG 


GCT 


48 


W--> 65 


Ala 


Arg 


o;j_n 


Leu 


Ala 


Ala 


Xaa 


Gin 


Ala 


val 


Tyr 


Val 


Lys 


Val 


Lys 


Ala 




66 


1 








0 










10 










15 






68 


GAA 


GCC 


CGG 


GAA 


CTG 


CTG 


GGC 


CAC 


CCG 


TGG 


TCT 


CTG 


TGT 


CCT 


GTC 


TGT 


96 


69 


Glu 


Ala 


Arg 


Glu 


Leu 


Leu 


Gly 


His 


Pro 


Trp 


Ser 


Leu 


Cys 


Pro 


Val 


Cys 




70 








20 










25 










30 








72 


GGG 


TGC 


CAA 


CTC 


ACC 


ACC 


TTT 


GAT 


GGG 


GCC 


CGT 


GGT 


GCC 


ACC 


ACT 


CTC 


144 


73 


Gly 


Cys 


Gin 


Leu 


Thr 


Thr 


Phe 


Asp 


Gly 


Ala 


Arg 


Gly 


Ala 


Thr 


Thr 


Leu 




74 






35 










40 










45 










76 


CTG 


GTG 


TCT 


ATG 


AAG 


CTC 


TCT 


TCC 


CGC 


TGC 


CCA 


GGA 


CTA 


CAG 


AAT 


ACC 


192 


77 


Leu 


Val 


Ser 


Met 


Lys 


Leu 


Ser 


Ser 


Arg 


Cys 


Pro 


Gly 


Leu 


Gin 


Asn 


Thr 




78 




50 










55 










60 












80 


ATC 


CCC 


TGG 


TAC 


CGT 


GTA 


GTT 


GCC 


GAA 


GTC 


CAG 


ATC 


TGC 


CAT 


GGC 


AAA 


240 


81 


lie 


Pro 


Trp 


Tyr 


Arg 


Val 


Val 


Ala 


Glu 


Val 


Gin 


He 


Cys 


His 


Gly 


Lys 




82 


65 










70 










75 










80 




84 


ACG 


GAG 


GCT 


GTG 


GGC 


CAG 


GTC 


CAC 


ATC 


TTC 


TTC 


CAG 


GAT 


GGG 


ATG 


GTG 


288 


85 


Thr 


Glu 


Ala 


Val 


Gly 


Gin 


Val 


His 


He 


Phe 


Phe 


Gin 


Asp 


Gly 


Met 


Val 




86 










85 










90 










95 






88 


ACG 


TTG 


ACT 


CCA 


AAC 


AAG 


GGT 


GTG 


TGG 


GTG 


AAT 


GGT 


CTC 


CGA 


GTG 


GAT 


336 


89 


Thr 


Leu 


Thr 


Pro 


Asn 


Lys 


Gly 


Val 


Trp 


Val 


Asn 


Gly 


Leu 


Arg 


Val 


Asp 




90 








100 










105 










110 








92 


CTC 


CCA 


GCT 


GAG 


AAG 


TTA 


GCA 


TCT 


GTG 


TCC 


GTG 


AGT 


CGT 


ACA 


CCT 


GAT 


384 


93 


Leu 


Pro 


Ala 


Glu 


Lys 


Leu 


Ala 


Ser 


Val 


Ser 


Val 


Ser 


Arg 


Thr 


Pro 


Asp 




94 






115 










120 










125 










96 


GGC 


TCC 


CTG 


CTA 


GTC 


CGC 


CAG 


AAG 


GCA 


GGG 


GTC 


CAG 


GTG 


TGG 


CTT 


GGA 


432 


97 


Gly 


Ser 


Leu 


Leu 


Val 


Arg 


Gin 


Lys 


Ala 


Gly 


Val 


Gin 


Val 


Trp 


Leu 


Gly 




98 




130 










135 










140 













100 GCC AAT GGG AAG GTG GCT GTG ATT GTG AGC AAT GAC CAT GCT GGG AAA 480 

101 Ala Asn Gly Lys Val Ala Val He Val Ser Asn Asp His Ala Gly Lys 

102 145 150 155 160 

104 CTG TGT GGG GGC CTK TGG AAA ATTTGACGGG GGACCAGACC AATGATTGGG 531 

105 Leu Cys Gly Gly Xaa Trp Lys 

106 165 

108 ATGATTCCCA GGAGAAGCCA GCGATTGGGG AAWTGGAGAG CGCAGGGACT TTCTYCCMCA 591 
110 TGTTAATGGG CTTGWTCCAG TTCATCCCAC CAGGAACGAA GGATTTT 638 
113 (2) INFORMATION FOR SEQ ID NO : 2: 

115 (i) SEQUENCE CHARACTERISTICS: 

116 (A) LENGTH: 167 amino acids 

117 (B) TYPE: amino acid 

118 (D) TOPOLOGY: linear 
120 (ii) MOLECULE TYPE: protein 

122 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

W--> 124 Ala Arg Gin Leu Ala Ala Xaa Gin Ala Val Tyr Val Lys Val Lys Ala 
125 15 10 15 

12 7 Glu Ala Arg Glu Leu Leu Gly His Pro Trp Ser Leu Cys Pro Val Cys 
128 20 25 30 

130 Gly Cys Gin Leu Thr Thr Phe Asp Gly Ala Arg Gly Ala Thr Thr Leu 

131 35 40 45 
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133 


Leu 


Val 


Ser 


Met 


Lys 


Leu 


Ser 


Ser 


Arg 


Cys 


Pro 


biy 


Leu 


bin 


Asn 


Thr 


1 "3 A 

134 




DV 




















D U 










136 


lie 


Pro 


Trp 


Tyr 


Arg 


Val 


Val 


Ala 


blU 


vai 


bin 


T "J — 

lie 


Cys 


IT -i fi 
HIS 


biy 


Lys 


1 3 1 


0 D 










/ u 




















o V 


ion 

139 


Thr 


Glu 


Ala 


Val 


Gly 


bin 


-IT.-. 1 

Val 


TT-! n 

HIS 


Ti- 
ne 


pne 


pne 


bin 


Asp 


biy 


Met 


vai 


140 










0 D 










y u 










y d 




142 


Thr 


Leu 


Thr 


Pro 


Asn 


Lys 


Gly 


Val 


Trp 


vai 


Asn 


biy 


Leu 


Arg 


vai 


Asp 


143 








100 










105 










11U 






145 


Leu 


Pro 


Ala 


Glu 


Lys 


Leu 


Ala 


Ser 


Val 


Ser 


Val 


Ser 


Arg 


Thr 


Pro 


Asp 


146 






115 










120 










Izj 








148 


Gly 


Ser 


Leu 


Leu 


Val 


Arg 


Gin 


Lys 


Ala 


Gly 


vai 


Gin 


Val 


Trp 


Leu 


Gly 


149 




130 










135 










1 A Ci 

14U 










151 


Ala 


Asn 


Gly 


Lys 


Val 


Ala 


Val 


He 


Val 


Ser 


Asn 


Asp 


His 


Ala 


Gly 


Lys 




1 A K 










150 










1 cc 
Ijj 










1 D U 


154 


Leu 


Cys 


Gly Gly 


Xaa 


Trp 


Lys 




















155 










165 
























ID / 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 


3 : 
















159 




(i) 


SEQUENCE CHARACTERISTICS : 
















160 






(A) LENGTH: 874 base pairs 














161 






(B) TYPE: 


nucleic 


acid 
















162 






(C) STRANDEDNESS : 


single 
















163 






(D) TOPOLOGY: 


linear 


















165 




(ii) 


MOLECULE TYPE: 


cDNA 


















168 




(ix) 


FEATURE : 
























169 






(A) NAME/KEY: 


CDS 




















170 






(B) LOCATION: 


1. .705 


















172 




(ix) 


FEATURE : 
























173 






(A) NAME/KEY: 


mat_peptide 
















174 






(B) LOCATION: 


1. .705 


















177 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 3 












179 


CAG 


GAC 


TGC 


GTG 


TGC 


ACG 


GAC 


AAG 


GTG 


GAC 


AAC 


AAC 


ACC 


CTG 


CTC 


AAC 


180 


Gin 


Asp 


Cys 


Val 


Cys 


Thr 


Asp 


Lys 


Val 


Asp 


Asn 


Asn 


Thr 


Leu 


Leu 


Asn 


TOT 

lol 


1 








5 










10 










Id 




183 


GTC 


ATC 


GCC 


TGC 


ACC 


CAC 


GTG 


CCC 


TGC 


AAC 


ACC 


TCC 


TGC 


AGC 


CCT 


GGG 


184 


Val 


lie 


Ala 


Cys 


Thr 


His 


Val 


Pro 


Cys 


Asn 


Thr 


Ser 


Cys 


Ser 


Pro 


Gly 


185 








20 










25 










30 






187 


TTC 


GAA 


CTC 


ATG 


GAG 


GCC 


CCC 


GGG 


GAG 


TGC 


TGT 


AAG 


AAG 


TGT 


GAA 


CAG 


188 


Phe 


Glu 


Leu 


Met 


Glu 


Ala 


Pro 


Gly 


Glu 


Cys 


Cys 


Lys 


Lys 


Cys 


Glu 


Gin 


189 






35 










40 










45 








190 


ACG 


CAC 


TGT 


ATC 


ATC 


AAA 


CGG 


CCC 


GAC 


AAC 


CAG 


CAC 


GTC 


ATC 


CTG 


AAG 


191 


Thr 


His 


Cys 


He 


He 


Lys 


Arg 


Pro 


Asp 


Asn 


Gin 


His 


Val 


He 


Leu 


Lys 


192 




50 










55 










60 










194 


CCC 


GGG 


GAC 


TTC 


AAG 


AGC 


GAC 


CCG 


AAG 


AAC 


AAC 


TGC 


ACA 


TTC 


TTC 


AGC 


195 


Pro 


Gly 


Asp 


Phe 


Lys 


Ser 


Asp 


Pro 


Lys 


Asn 


Asn 


Cys 


Thr 


Phe 


Phe 


Ser 


196 


65 










70 










75 










80 


198 


TGC 


GTG 


AAG 


ATC 


CAC 


AAC 


CAG 


CTC 


ATC 


TCG 


TCC 


GTT 


TCC 


AAC 


ATC 


ACC 


199 


Cys 


Val 


Lys 


He 


His 


Asn 


Gin 


Leu 


He 


Ser 


Ser 


Val 


Ser 


Asn 


He 


Thr 


200 










85 










90 










95 




202 


TGC 


CCC 


AAC 


TTT 


GAT 


GCC 


AGC 


ATT 


TGC 


ATC 


CCG 


GGC 


TCC 


ATC 


ACA 


TTC 



48 
96 
144 
192 
240 
288 
336 
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z U o 


Cys 


Pro 


Asn 


irne 


Asp 


Ala 

Ala 


Ser 


Tl rt 

lie 


Cys 


lie 


Pro 


Gly 


Ser 


lie 


Thr 


Pne 




on a 
zU4 








i ft ft 










i ft ^ 

1U J 










lift 

11U 








o ft £ 
ZU 0 


7V rp fi 

AIL 


LLL 


A A rp 
AA1 


LLA 


rp f~~i 

1 LL 


ILL 


a a r* 
AAL 


ALL 


rp r~\ 

ILL 


A 

ALL 


LL1 


LLL 


AAT 


GAG 


ACC 


AGG 


"5 O A 

384 


O ft H 
Z \J 1 




Pro 


Asn 


Liy 


Cys 


Cys 


Lys 


Thr 


Cys 


Thr 


Pro 


Arg 


Asn 


Glu 


Thr 


Arg 




O ft Q 
Z U O 






11 D 










1 O ft 
IzU 










1 

IzD 










O 1 ft 


L1L 


LLL 


ILL 


1 LL 


ALL 


LI L 


LLL 


LI L 


ALL 


ALL 


LAL 


r* mm 
LI 1 


1 LL 


TAC 


GCC 


GGC 


A O O 

4 3z 


Oil 

zii 


17 a 1 

Val 


Pro 


Cys 


Ser 


i nr 


Val 


Pro 


vai 


inr 


Thr 


pl n 

LIU 


vai 


Ser 


Tyr 


Ala 


Gly 




Z1Z 




i ~i ft 










1J J 










1 A ft 

14 U 












O 1 A 

z 14 


ILL 


ALL 


A A H 

AAL 


A O O 

ALL 


LI L 


L1L 


A rp /-* 
AIL 


A A m 
AA1 


CAT 


TGC 


TCC 


GGG 


TCC 


TGC 


GGG 


ACA 


480 


T1 c 

zIj 


Cys 


Thr 


Lys 


Thr 


vai 


Leu 


Met 


Asn 


His 


Cys 


Ser 


Gly 


Ser 


Cys 


Gly 


Thr 




Z 1 0 


1 A *\ 
14 J 










i ^ft 

iJU 










Ijj 










loU 




O 1 O 

zio 


TTT 


r~i rn s~\ 

L1L 


AIL 


TAC 


rri sir** 
1 Lb 


(~y /~\ 

LLL 


AAG 


GCC 


CAG 


GCC 


CTG 


GAC 


CAC 


AGC 


TGC 


TCC 


528 


ziy 


Pne 


vai 


Met. 


Tyr 


Ser 


Ala 


Lys 


Ala 


Gin 


Ala 


Leu 


Asp 


His 


Ser 


Cys 


Ser 




O O ft 
zZU 










i ^ ^ 










i n ft 
1 / u 










175 






ZZZ 


TLL 


ILL 


AAA 


LAO 


LAL 


AAA 


ACC 


AGC 


CAG 


CGT 


GAG 


GTG 


GTC 


CTG 


AGC 


TGC 


576 


ZZ5 


Cys 


Cys 


Lys 


Glu 


Glu 


Lys 


Thr 


Ser 


Gin 


Arg 


Glu 


Val 


Val 


Leu 


Ser 


Cys 




ZZQ 








180 










185 










190 








O O £ 
ZZO 


CCC 


AAT 


GGC 


GGC 


TCG 


CTG 


ACA 


CAC 


ACC 


TAC 


ACC 


CAC 


ATC 


GAG 


AGC 


TGC 


624 


TOT 
ZZ 1 


Pro 


Asn 


Gly 


Gly 


Ser 


Leu 


Thr 


His 


Thr 


Tyr 


Thr 


His 


lie 


Glu 


Ser 


Cys 




O O Q 

z z o 






195 










200 










205 










^ Q ft 

z jU 


CAG 


TGC 


CAG 


GAC 


ACC 


GTC 


TGC 


GGG 


CTC 


CCC 


ACC 


GGC 


ACC 


TCC 


CGC 


CGG 


672 


z jl 


Gin 


Cys 


Gin 


Asp 


Thr 


Val 


Cys 


Gly 


Leu 


Pro 


Thr 


Gly 


Thr 


Ser 


Arg 


Arg 




TOO 

Z5Z 




210 










215 










220 












z j 4 


GCC 


CGG 


CGT 


TCC 


CCT 


AGG 


CAT 


CTG 


GGG 


AGC 


GGG 


TGAGCGGGGT GGGCACAGCC 


725 


O o c 

Z J J 


Ala 


Arg 


Arg 


Ser 


Pro 


Arg 


His 


Leu 


Gly 


Ser 


Gly 














0 "3 £ 


225 










230 










235 














zoo 


CCTTCACTGC CCTCGACAGC TTTACCTCCC CCGGACCCTC 


TGAGCCTCCT AAGCTCGGCT 


785 


0 jl ft 
Z 4 U 


TCCTCTCTTC AGATATTTAT TGTCTGAGTT TTTGTTCAGT 


CCTTGCTTTC CAATAATAAA 


845 


z 4 Z 


CTCAGGGGGA CATGCAAAAA AAAAAAAAA 


















874 


Z 4 D 


(2) 


INFORMATION 


FOR 


SEQ 


ID NO: 4: 


















Z4 / 




(i) 


SEQUENCE CHARACTERISTICS: 


















z 4 o 






(A) LENGTH: 235 amino 


acids 
















z4 y 






(B) TYPE: 


amino acid 




















z DU 






(D) TOPOLOGY: 


linear 




















OCT 
Z DZ 




(ii) 


MOLECULE TYPE: 


protein 




















O C *3 
ZOO 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO : 4: 














o c c 

Z JJ 


Gin 


Asp 


Cys 


Val 


Cys 


Thr 


Asp 


Lys 


Val 


Asp 


Asn 


Asn 


Thr 


Leu 


Leu 


Asn 






1 








5 










10 










15 






258 


Val 


He 


Ala 


Cys 


Thr 


His 


Val 


Pro 


Cys 


Asn 


Thr 


Ser 


Cys 


Ser 


Pro Gly 




259 








20 










25 










30 








261 


Phe 


Glu 


Leu 


Met 


Glu 


Ala 


Pro 


Gly 


Glu 


Cys 


Cys 


Lys 


Lys 


Cys 


Glu 


Gin 




262 






35 










40 










45 










264 


Thr 


His 


Cys 


He 


He 


Lys 


Arg 


Pro 


Asp 


Asn 


Gin 


His 


Val 


He 


Leu 


Lys 




265 




50 










55 










60 












267 


Pro 


Gly 


Asp 


Phe 


Lys 


Ser 


Asp 


Pro 


Lys 


Asn 


Asn 


Cys 


Thr 


Phe 


Phe 


Ser 




268 


65 










70 










75 










80 




270 


Cys 


Val 


Lys 


He 


His 


Asn 


Gin 


Leu 


He 


Ser 


Ser 


Val 


Ser 


Asn 


He 


Thr 




271 










85 










90 










95 






273 


Cys 


Pro 


Asn 


Phe 


Asp 


Ala 


Ser 


He 


Cys 


He 


Pro 


Gly 


Ser 


He 


Thr 


Phe 
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274 100 105 110 

276 Met Pro Asn Gly Cys Cys Lys Thr Cys Thr Pro Arg Asn Glu Thr Arg 

277 115 120 125 

279 Val Pro Cys Ser Thr Val Pro Val Thr Thr Glu Val Ser Tyr Ala Gly 

280 130 135 140 

282 Cys Thr Lys Thr Val Leu Met Asn His Cys Ser Gly Ser Cys Gly Thr 

283 145 150 155 160 

285 Phe Val Met Tyr Ser Ala Lys Ala Gin Ala Leu Asp His Ser Cys Ser 

286 165 170 175 

288 Cys Cys Lys Glu Glu Lys Thr Ser Gin Arg Glu Val Val Leu Ser Cys 

289 180 185 190 

291 Pro Asn Gly Gly Ser Leu Thr His Thr Tyr Thr His He Glu Ser Cys 

292 195 200 205 

294 Gin Cys Gin Asp Thr Val Cys Gly Leu Pro Thr Gly Thr Ser Arg Arg 

295 210 215 220 

297 Ala Arg Arg Ser Pro Arg His Leu Gly Ser Gly 

298 225 230 235 
300 (2) INFORMATION FOR SEQ ID NO : 5: 

302 (i) SEQUENCE CHARACTERISTICS: 

303 (A) LENGTH: 1209 base pairs 

304 (B) TYPE: nucleic acid 

305 (C) STRANDEDNESS : single 

306 (D) TOPOLOGY: linear 
308 (ii) MOLECULE TYPE: cDNA 

313 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

315 ATTGGTGCTA CCTGGCTCTC CTGTCTCTGC AGCTCTACAG GTGAGGCCCA GCAGAGGGAG 

316 TAGGGCTCGC CATGTTTCTG GTGAGCCAAT TTGGCTGATC TTGGGTGTCT GAACAGCTAT 120 
318 TGGGTCCACC CCAGTCCCTT TCAGCTGCTG CTTAATGCCC TGCTCTCTCC CTGGCCCACC 180 
320 TTATAGAGAG CCCAAAGAGC TCCTGTAAGA GGGAGAACTC TATCTGTGGT TTATAATCTT 240 
322 GCACGAGGCA CCAGAAGTCT CCCTGGGTCT TGTGAATGAA CTACATTTAT CCCCTTTCCT 300 
324 GCCCCAACCA CAAACTCTTT CCTTCAAAGA GGGCCTGCCT GGTTCCCTCC ACCCAACTGC 360 
326 ACCATGAGAT CGGTCCAAGA GTCCATTCCC CAGGTGGGAG CCAACTGTCA GGGAGGTCTT 4 20 
328 TCCCACCAAA CATCTTTCAG TTGCTGGGAG GTGACCATAG GGCTCTGCTT TTAAAGATAT 4 80 
3 30 GGCTGCTTCA AAGGCCAGAG TCACAGGAAG GACTTCTTCC AGGGAGATTA GTGGTGATGG 54 0 
332 AGAGGAGAGT TAAAATGACC TCATGTCCTT CTTGTCCACG GTTTTGTTGA GTTTTCACTC 600 
334 TTCTAATGCA AGGGTCTCAC ACTGTGAACC ACTTAGGATG TGATCACTTT CAGGTGGCCA 6 60 
33 6 GGAATGTTGA ATGTCTTTGG CTCAGTTCAT CTAAAAAAGA TATCTATTTG AAAGTTCTCA 720 

33 8 GAGTTGTACA TATGTTTCAC AGTACAGGAT CTGTACATAA AAGTTTCTTT CCTAAACCAT 780 

34 0 TCACCAAGAG CCAATATCTA GGCATTTCCT CGGTAGCACA AATTTTCTNA TTGCTTAGAA 840 
34 2 AATTGTCCTC CCTGTTCTTT CTGTCTGNAG ACTTAAGTGA GTTAGGTCTT TAAGGAAAGC 900 
344 AACGCTCCTC TGAAATGCTT GTCTTTTTTC TGTTGCCGAA ATAGCTGGTC CTTTTTCGGG 960 
34 6 AGTTAGATGT ATAGAGTGTT TGTATGTAAA CATTTCTTGT AGGCATCACC ATGAACANAG 1020 
348 ATATATTTTC TATTTANTTA NTATATGTGC ACTTCAAGAA GTCACTGTCA G AG AAA T AAA 1080 
350 GAATTGTCTT AAATGTCATG ATTGGAGATG TCCTTTGCAT TGCTTGGAAG GGGTGTACCT 1140 
352 AGAGCCAAGG AAATTGGCTC TGGTTTGGAA AAATTTTGCT GTTATTATAG TAAACATACA 1200 
354 AAGGATGTC 

356 (2) INFORMATION FOR SEQ ID NO : 6: 

358 (i) SEQUENCE CHARACTERISTICS: 

359 (A) LENGTH: 548 base pairs 



60 



1209 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/09/988 , 292A 



DATE: 04/21/2003 
TIME: 15:30:58 



Input Set : A:\pfl60d2seqlistcopy .txt 
Output Set: N:\CRF4\04212003\I988292A.raw 



Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:l; Xaa Pos.7,165 
Seq#:2; Xaa Pos.7,16 5 

Seq#:5; N Pos . 829,8 68,1018,1037,1041 
Seq#:6; N Pos. 470,485 

Seq#:8; N Pos. 775,817,844,846,856,857,859,861,863,871,872 
Seq#:21; N Pos . 70,366,388,389,399 

Seq#:22; N Pos. 361,382,385,446,447,469,471,536,581,584,586 
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VERIFICATION SUMMARY 

PATENT APPLICATION : US/09/988 , 292A 



DATE: 04/21/2003 
TIME: 15:30:58 



Input Set : A:\pfl60d2seqlistcopy .txt 
Output Set: N:\CRF4\04212003\I988292A.raw 



L:29 M:220 C: Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
L:30 M:220 C: Keyword misspelled or invalid format, [(B) FILING DATE:] 
L:65 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 1 after pos.:48 
M:341 Repeated in SeqNo=l 

L:124 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 2 after pos . : 0 
M:341 Repeated in SeqNo=2 
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